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(57) Abstract 

A method for isolating mRNAs as cDNAs employs a polymerase amplification method using at least two oligodeoxynu- 
cleotide primers. In one approach, the first primer contains sequence capable of hybridizing to a site immediately upstream of the 
first A ribonucleotide of the mRNA's poly A tail and the second primer contains arbitrary sequence. In another approach, the 
first primer contains sequence capable of hybridizing to a site including the mRNA's polyA signal sequence and the second pri- 
mer contains arbitrary sequence. In another approach, the first primer contains arbitrary sequence and the second primer con- 
tains sequence capable of hybridizing to a site including the Kozak sequence. In another approach, the first primer contains a se- 
quence that is substantially complementary to the sequence of a mRNA having a known sequence and the second primer contains 
arbitrary sequence. In another approach, the first primer contains arbitrary sequence and the second primer contains sequence 
that is substantially identical to the sequence of a mRNA having a known sequence. The first primer is used as a primer for re- 
verse transcription of the mRNA and the resultant cDNA is amplified with a polymerase using both the First and second primers 
as a primer set. 
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"Methods to clone mRNA tt . 

Background of the Invention 

This application is a continuation-in-part of the co-pending application U.S. 
Serial No. 07/850,343, filed on March 11, 1992. 
5 This invention relates to methods of detecting and cloning of individual 

mRNAs. 

The activities of genes in cells are reflected in the kinds and quantities of 
their mRNA and protein species. Gene expression is crucial for processes such 
as aging, development, differentiation, metabolite production, progression of the 

10 cell cycle, and infectious or genetic or other disease states. Identification of the 
expressed mRNAs will be valuable for the elucidation of their molecular 
mechanisms, and for applications to the above processes. 

Mammalian cells contain approximately 15,000 different mRNA sequences, 
however, each mRNA sequence is present at a different frequency within the 

15 cell. Generally, mRNAs are expressed at one of three levels. A few "abundant" 
mRNAs are present at about 10,000 copies per cell, about 3,000-4,000 
"intermediate" mRNAs are present at 300-500 copies per cell, and about 11,000 
"low-abundance" or "rare" mRNAs are present at approximately 15 copies per 
cell. The numerous genes that are represented by intermediate and low 

20 frequencies of their mRNAs can be cloned by a variety of well established 
techniques (see for example Sambrook et ah , 1989, Molecular Cloning: A 
Laboratory Manual, Second Edition, Cold Spring Harbor Press, pp. 8.6-8.35). 

If some knowledge of the gene sequence or protein is had, several direct 
cloning methods are available. However, if the identity of the desired gene is 

25 unknown one must be able to select or enrich for the desired gene product in 
order to identify the "unknown" gene without expending large amounts of time 
and resources. 

The identification of unknown genes can often involve the use of 
subtractive or differential hybridization techniques. Subtractive hybridization 
30 techniques rely upon the use of very closely related cell populations, such that 
differences in gene expression will primarily represent the gene(s) of interest. A 
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key element of the subtractive hybridi2ation technique is the construction of a 
comprehensive complementary-DNA ("cDNA") library. 

The construction of a comprehensive cDNA library is now a fairly routine 
procedure. PolyA mRNA is prepared from the desired cells and the first strand 
5 of the cDNA is synthesized using RNA-dependent DNA polymerase ("reverse 
transcriptase") and an oligodeoxynucleotide primer of 12 to 18 thymidine 
residues. The second strand of the cDNA is synthesized by one of several 
methods, the more efficient of which are commonly known as "replacement 
synthesis" and "primed synthesis". 
10 Replacement synthesis involves the use of ribonuclease H ("RNAase H"), 

which cleaves the phosphodiester backbone of RNA that is in a RNArDNA 
hybrid leaving a 3' hydroxyl and a 5' phosphate, to produce nicks and gaps in 
the mRNA strand, creating a series of RNA primers that are used by E. coli 
DNA polymerase I, or its "Klenow" fragment, to synthesize the second strand of 
15 the cDNA. This reaction is very efficient; however, the cDNAs produced most 
often lack the 5' terminus of the mRNA sequence. 

Primed synthesis to generate the second cDNA strand is a general name for 
several methods which are more difficult than replacement synthesis yet clone the 
5' terminal sequences with high efficiency. In general, after the synthesis of the 
20 first cDNA strand, the 3' end of the cDNA strand is extended with terminal 

transferase, an enzyme which adds a homopolymeric "tail" of deoxynucleotides r 
most commonly deoxycytidylate. This tail is then hybridized to a primer of 
oligodeoxyguanidylale or a synthetic fragment of DNA with an deoxyguanidylate 
tail and the second strand of the cDNA is synthesized using a DNA-dependent 
25 DNA polymerase. 

The primed synthesis method is effective, but the method is laborious, and 
all resultant cDNA clones have a tract of deoxyguanidylate immediately upstream 
of the mRNA sequence. This deoxyguanidylate tract can interfere with 
transcription of the DNA in vitro or in vivo and can interfere with the sequencing 
30 of the clones by the Sanger dideoxynucleotide sequencing method. 

Once both cDNA strands have been synthesized, the cDNA library is 
constructed by cloning the cDNAs into an appropriate plasmid or viral vector. 
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In practice this can be done by directly ligating the blunt ends of the cDNAs into 
a vector which has been digested by a restriction endonuclease to produce blunt 
ends. Blunt end ligations are very inefficient, however, and this is not a 
common method of choice. A generally used method involves adding synthetic 
5 linkers or adapters containing restriction endonuclease recognition sequences to 
the ends of the cDNAs. The cDNAs can then be cloned into the desired vector 
at a greater efficiency. 

Once a comprehensive cDNA library is constructed from a cell line, 
desired genes can be identified with the assistance of subtractive hybridization 
(see for example Sargent T.D., 1987, Meth. EnzymoL, Vol. 152, pp. 423-432; 
Lee et ah, 1991, Proc. Natl. Acad. Sci., USA, Vol. 88, pp. 2825-2830). A 
general method for subtractive hybridization is as follows. The complementary 
strand of the cDNA is synthesized and radiolabelled. This single strand of 
cDNA can be made from polyA mRNA or from the existing cDNA library. The 
radiolabelled cDNA is hybridized to a large excess of mRNA from a closely 
related cell population. After hybridization the cDNA:mRNA hybrids are 
removed from the solution by chromatography on a hydroxylapatite column. The 
remaining "subtracted" radiolabelled cDNA can then be used to screen a cDNA 
or genomic DNA library of the same cell population. 

Subtractive hybridization removes the majority of the genes expressed in 
both cell populations and thus enriches for genes which are present only in the 
desired cell population. However, if the expression of a particular mRNA 
sequence is only a few times more abundant in the desired cell population than 
the subtractive population it may not be possible to isolate the gene by 
25 subtractive hybridization. 

Summary of the Invention 

We have discovered a method for identifying, isolating and cloning 
mRNAs as cDNAs using a polymerase amplification method that employs at least 
two oUgodeoxynucleotide primers. In one approach, the first primer contains 
sequence capable of hybridizing to a site including sequence that is immediately 
upstream of the first A ribonucleotide of the mRNA's polyA tail and the second 
primer contains arbitrary sequence. In another approach, the first primer 
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contains sequence capable of hybridizing to a site including the mRNA's polyA 
signal sequence and the second primer contains arbitrary sequence. In another 
approach, the first primer contains arbitrary sequence and the second primer 
contains sequence capable of hybridizing to a site including the mRNA's Kozak 
5 sequence. In another approach, the first primer contains a sequence that is 
substantially complementary to the sequence of a mRNA having a known 
sequence and the second primer contains arbitrary sequence. In another 
approach, the first primer contains arbitrary sequence and the second primer 
contains sequence that is substantially identical to the sequence of a mRNA 

10 having a known sequence. The first primer is used as a primer for reverse 
transcription of the mRNA and the resultant cDNA is amplified with a 
polymerase using both the first and second primers as a primer set. 

Using this method with different pairs of the alterable primers, virtually 
any or all of the mRNAs from any cell type or any stage of the cell cycle, 

15 including very low abundance mRNAs, can be identified and isolated. 

Additionally a comparison of the mRNAs from closely related cells, which may 
be for example at different stages of development or different stages of the cell 
cycle, can show which of the mRNAs are constitutively expressed and which are 
differentially expressed, and their respective frequencies of expression. 

20 The "first primer" or "first oligodeoxynucleotide" as used herein is defined 

as being the oligodeoxynucleotide primer that is used for the reverse transcription 
of the mRNA to make the first cDNA strand, and then is also used for 
amplification of the cDNA. The first primer can also be referred to as the 3' 
primer, as this primer will hybridize to the mRNA and will define the 3' end of 

25 the first cDNA strand. The "second primer" as used herein is defined as being 
the oligodeoxynucleotide primer that is used to make the second cDNA strand, 
and is also used for the amplification of the cDNA. The second primer may also 
be referred to as the 5' primer, as this primer will hybridize to the first cDNA 
strand and will define the 5' end of the second cDNA strand. 

30 The "arbitrary" sequence of an oligodeoxynucleotide primer as used herein 

is defined as being based upon or subject to individual judgement or discretion, 
lii some instances, the arbitrary sequence can be entirely random or partly 
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random for one or more bases. In other instances the arbitrary sequence can be 
selected to contain a specific ratio of each deoxynucleotide, for example 
approximately equal proportions of each deoxynucleotide or predominantly one 
deoxynucleotide, or to not contain a specific deoxynucleotide. The arbitrary 
sequence can be selected to contain, or not to contain, a recognition site for 
specific restriction endonuclease. The arbitrary sequence can be selected to 
either contain a sequence that is substantially identical (at least 50 homologous) 
to a mRNA of known sequence or to not contain sequence from a mRNA of 
known sequence. 

An oligodeoxynuceotide primer can be either "complementary" to a 
sequence or "substantially identical" to a sequence. As defined herein, a 
complementary oligodeoxynucleotide primer is a primer that contains a sequence 
which will hybridize to an mRNA, that is the bases are complementary to each 
other and a reverse transcriptase will be able to extend the primer to form a 
cDNA strand of the mRNA. As defined herein, a substantially identical primer 
is a primer that contains sequence which is the same as the sequence of an 
mRNA, that is greater than 50% identical, and the primer has the same 
orientation as an mRNA thus it will not hybridize to, or complement, an mRNA 
but such a primer can be used to hybridize to the first cDNA strand and can be 
extended by a polymerase to generate the second cDNA strand. The terms of art 
"hybridization" or "hybridize", as used herein, are defined to be the base pairing 
of an oHgodeoxynucleotide primer with a mRNA or cDNA strand. The 
"conditions under which" an oligodeoxynucleotide hybridizes with an mRNA or 
a cDNA, as used herein, is defined to be temperature and buffer conditions (that 
are described later) under which the base pairing of the oligodeoxynucleotide 
primer with either an mRNA or a cDNA occurs and only a few mismatches (one 
or two) of the base pairing are permissible. 

An oligonucleotide primer can contain a sequence that is known to be a 
"consensus sequence" of an mRNA of known sequence. As defined herein, a 
"consensus sequence" is a sequence that has been found in a gene family of 
proteins having a similar function or similar properties. The use of a primer that 
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includes a consensus sequence may result in the cloning of additional members of 



The "preferred length" of an oligodeoxynucleotide primer, as used herein, 
is determined from the desired specificity of annealing and the number of 
oligodeoxynucleotides having the desired specificity that are required to hybridize 
to all the mRNAs in a cell. An oligodeoxynucleotide primer of 20 nucleotides is 
more specific than an oligodeoxynucleotide primer of 10 nucleotides; however, 
addition of each random nucleotide to an oligodeoxynucleotide primer increases 
by four the number of oligodeoxynucleotide primers required in order to 
hybridize to every mRNA in a cell 

In one aspect, in general, the invention features a method for identifying 
and isolating mRNAs by priming a preparation of mRNA for reverse 
transcription with a first oligodeoxynucleotide primer that contains sequence 
capable of hybridizing to a site including sequence that is immediately upstream 
of the first A ribonucleotide of the mRNA's poly A tail, and amplifying the 
cDNA by a polymerase amplification method using the first primer and a second 
oligodeoxynucleotide primer, for example a primer having arbitrary sequence, as 
a primer set. 

In preferred embodiments, the first primer contains at least 1 nucleotide at 
the 3' end of the oligodeoxynucleotide that can hybridize to an mRNA sequence 
that is immediately upstream of the polyA tail, and contains at least 11 
nucleotides at the 5' end that will hybridize to the polyA tail. The entire 3' 
oligodeoxynucleotide is preferably at least 13 nucleotides in length, and can be 
up to 20 nucleotides in length. 

Most preferably, the first primer contains 2 nucleotides at the 3' end of the 
oligodeoxynucleotide that can hybridize to an mRNA sequence that is 
immediately upstream of the polyA tail. Preferably, the 2 polyA-non- 
complementary nucleotides are of the sequence VN, where V is deoxyadenylate 
CdA"), deoxyguanylate ("dG"), or deoxycytidylate ("dC"), and N, the 3' 
terminal nucleotide, is dA, dG, dC, or deoxythymidylate ("dT"). Thus the 
sequence of a preferred first primer is 5'-TTTTTTITTTTVN [Seq. ID. No. 1]. 
The use of 2 nucleotides can provide accurate positioning of the first primer at 



a desired gene family. 
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foe junction between the mRNA and its polyA tail, as the properly aligned 
oHgodeoxy nucleotide: mRNA hybrids are more stable than improperly aligned 
hybrids, and thus the properly aligned hybrids will form and remain hybridized 
at higher temperatures. In preferred applications, the mRNA sample will be 
divided into at least twelve aliquots and one of the 12 possible VN sequences of 
the first primer will be used in each reaction to prime the reverse transcription of 
the mRNA. The use of an oligodeoxynucleotide with a single sequence will 
reduce the number of mRNAs to be analyzed in each sample by binding to a 
subset of the mRNAs, statistically l/12th, thus simplifying the identification of 
the mRNAs in each sample. 

In some embodiments, the 3' end of the first primer can have 1 nucleotide 
that can hybridize to an mRNA sequence that is immediately upstream of the 
polyA tail, and 12 nucleotides at the 5' end that will hybridize to the polyA tail, 
thus the primer will have the sequence 5'-TTlTTTTTITTTV [Seq. ED. No. 2]. 
The use of a single non-polyA-complementary deoxynucleotide would decrease 
the number of oligodeoxynucleotides that are required to identify every mRNA to 
3, however, the use of a single nucleotide to position the annealing of primer to 
the junction of the mRNA sequence and the polyA tail may result in a 
significant loss of specificity of the annealing and 2 non-polyA-complementary 
nucleotides are preferred. 

In some embodiments, the 3' end of the first primer can have 3 or more 
nucleotides that can hybridize to an mRNA sequence that is immediately 
upstream of the polyA tail. The addition of each nucleotide to the 3' end will 
further increase the stability of properly aligned hybrids, and the sequence to 
hybridize to the polyA tail can be decreased by one nucleotide for each additional 
non-polyA-complementary nucleotide added. The use of such a first primer may 
not be practical for rapid screening of the mRNAs contained within a given cell 
line, as the use of a first primer with more than 2 nucleotides that hybridize to 
the mRNA immediately upstream of the polyA tail significantly increases the 
number of oligodeoxynucleotides required to identify every mRNA. For 
instance, the primer 5'-TTTTTTTTTTVNN [Seq. ID. No. 3] would require the 
use of 48 separate first primers in order to bind to every mRNA, and would 
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significantly increase the number of reactions required to screen the mRNA from 
a given cell line. The use of oligodeoxynucleotides with a single random 
nucleotide in one position as a group of four can circumvent the problem of 
needing to set up 48 separate reactions in order to identify every mRNA. 
5 However as the non-poly A-complementaiy sequence became longer, it would 

quickly become necessary to increase the number of reactions required to identify 
every mRNA. 

In preferred embodiments, the second primer is of arbitrary sequence and 
is at least 9 nucleotides in length. Preferably the second primer is at most 13 

10 nucleotides in length and can be up to 20 nucleotides in length. 

In another aspect, in general, the invention features a method for preparing 
and isolating mRNAs by priming a preparation of mRNA for reverse 
transcription with a first primer that contains a sequence capable of hybridizing 
to the polyadenylation signal sequence and at least 4 nucleotides that are 

15 positioned 5', or 3', or both of the polyadenylation signal sequence; this entire 
first primer is preferably at least 10 nucleotides in length, and can be up to 20 
nucleotides in length. In one preferred embodiment the sequence 5'- 
NNTITATTNN [Seq. ID. No. 4] can be chosen such that the sequence is 5 1 - 
GCTTTATTNC [Seq. ID. No. 5], and the four resultant primers are used 

20 together in a single reaction for the p rimin g of the mRNA for reverse 
transcription. Once the first cDNA strand has been formed by reverse 
transcription then the first primer can be used with a second primer, for example 
and arbitrary sequence primer, for the amplification of the cDNA. 

In one aspect, in general, the invention features a method for identifying 

25 and isolating mRNAs by priming a preparation of mRNA for reverse 

transcription with a first oligodeoxynucleotide primer to generate a first cDNA 
strand, and priming the preparation of the second cDNA strand with a second 
primer that contains sequence substantially identical to the Kozak sequence of 
mRNA, and amplifying the cDNA by a polymerase amplification method using 

30 the first and second primers as a primer set. 

In preferred embodiments, the first and second primers are at least 9 
deoxynucleotides in length, and are at most 13 nucleotides in length, and can be 
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up to 20 nucleotides in length. Most preferably the first and second primers are 
10 deoxynucleotides in length. 

In preferred embodiments the sequence of the first primer is selected at 
random, or the first primer contains a selected arbitrary sequence, or the first 



In preferred embodiments the sequence of the second primer that contains 
sequence substantially identical to the Kozak sequence of mRNA has the 
sequence NNNANNATGN [Seq. ID No. 6], or has the sequence 
NNNANNATGG [Seq. ID No. 7]. Where N is any of the four 
deoxynucleotides. Preferably, the second primer has the sequence 
GCCACCATGG [Seq. ID No. 8], In some embodiments the first primer may 
further include a restriction endonuclease recognition sequence that is added to 
either the 5 ' or 3' end of the primer increasing the length of the primer by at 
least 5 nucleotides. 

In another aspect, in general, the invention features a method for 
identifying and isolating mRNAs by priming a preparation of mRNA for reverse 
transcription with a first oligodeoxynucleotide primer that contains sequence that 
is substantially complementary to the sequence of a mRNA having a known 
sequence, and priming the preparation of the second cDNA strand with a second 
primer and, amplifying the cDNA by a polymerase amplification method using 
the first and second primers as a primer set. 

In preferred embodiments, the first and second primers are at least 9 
deoxynucleotides in length, and are at most 13 nucleotides in length, and can be 
up to 20 nucleotides in length. Most preferably the first and second primers are 
10 deoxynucleotides in length. 

In preferred embodiments the sequence of the first primer further includes 
a restriction endonuclease sequence, which may be included within the preferred 
10 nucleotides of the primer or may be added to either the 3" or 5' end of the 
primer increasing the length of the oligodeoxynucleotide primer by at least 5 
nucleotides. 



primer contains a restriction endonuclease recognition sequence. 
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Ih preferred embodiments the sequence of the second primer Is selected at 
random, or the second primer contains a selected arbitrary sequence, or the 
second primer contains a restriction endonuclease recognition sequence. 
In another aspect, in general, the invention features a method for 
5 identifying and isolating mRNAs by priming a preparation of mRNA for reverse 
transcription with a first oligodeoxynucleotide primer, and priming the 
preparation of the second cDNA strand with a second primer that contains 
sequence that is substantially identical to the sequence of a mRNA having a 
known sequence and, amplifying the cDNA by a polymerase amplification 
10 method using the first and second primers as a primer set. 

In preferred embodiments, the first and second primers are at least 9 
deoxynucleotides in length, and are at most 13 nucleotides in length, and can be 
up to 20 nucleotides in length. Most preferably the first and second primers are 
10 deoxynucleotides in length. 
15 In preferred embodiments the sequence of the first primer is selected at 

random, or the first primer contains a selected arbitrary sequence, or the first 
primer contains a restriction endonuclease recognition sequence. 

In preferred embodiments the sequence of the second primer having a 
sequence that is substantially complementary to the sequence of an mRNA having 
20 a known sequence further includes a restriction endonuclease sequence, which 
may be included within the preferred 10 nucleotides of the primer or may be 
added to either the 3 * or 5' end of the primer increasing the length of the 
oligodeoxynucleotide primer by at least 5 nucleotides. 

In another aspect, in general, the invention features a method for 
25 identifying and isolating mRNAs by priming a preparation of mRNA for reverse 
transcription with a first oligodeoxynucleotide primer that contains sequence that 
is substantially complementary to the sequence of a mRNA having a known 
sequence, and priming the preparation of the second cDNA strand with a second 
primer that contains sequence that is substantially identical to the Kozak sequence 
30 of mRNA, and amplifying the cDNA by a polymerase amplification method 
using the first and second primers as a primer set. 
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In preferred embodiments, the first and second primers are at least 9 
deoxynucleotides in length, and are at most 13 nucleotides in length, and can be 
up to 20 nucleotides in length. Most preferably the first and second primers are 
10 deoxynucleotides in length. 

In some preferred embodiments of each of the general aspects of the 
invention, the amplified cDNAs are separated and then the desired cDNAs are 
reamplified using a polymerase amplification reaction and the first and second 
oligodeoxynucleotide primers. 

In preferred embodiments of each of the general aspects of the invention, a 
set of first and second oligodeoxynucleotide primers can be used, consisting of 
more than one of each primer. In some embodiments more than one of the first 
primer will be included in the reverse transcription reaction and more than one 
each of the first and second primers will be included in the amplification 
reactions. The use of more than one of each primer will increase the number of 
mRNAs identified in each reaction, and the total number of primers to be used 
will be determined based upon the desired method of separating the cDNAs such 
that it remains possible to fully isolate each individual cDNA. In preferred 
embodiments a few hundred cDNAs can be isolated and identified using 
denaturing polyacrylamide gel electrophoresis. 

The method according to the invention is a significant advance over current 
cloning techniques that utilize subtractive hybridization. In one aspect, the 
method according to the invention enables the genes which are altered in their 
frequency of expression, as well as of mRNAs which are constitutively and 
differentially expressed, to be identified by simple visual inspection and isolated. 
In another aspect the method according to the invention provides specific 
oligodeoxynucleotide primers for amplification of the desired mRNA as cDNA 
and makes unnecessary an intermediary step of adding a homopolymeric tail to 
the first cDNA strand for priming of the second cDNA strand and thereby 
avoiding any interference from the homopolymeric tail with subsequent analysis 
of the isolated gene and its product. In another aspect the method according to 
the invention allows the cloning and sequencing of selected mRNAs, so that the 
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investigator may determine the relative desirability of the gene prior to screening 
a comprehensive cDNA library for the full length gene product. 



Description of the Preferred Embodiments 



Drawings 



Fig. 1 is a schematic representation of the method according to the 
invention. 
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Fig. 2 is the sequence of the 3' end of the Nl gene from normal mouse 
fibroblast cells (A31) [Seq. ID. No. 9]. 

Fig. 3 is the Northern blot of the Nl sequence on total cellular RNA from 
normal and tumorigenic mouse fibroblast cells. 

Fig. 4 is a sequencing gel showing the results of amplification for mRNA 
prepared from four sources (lanes 1-4), using the Kozak primer alone, the AP-1 
primer alone, the Kozak and AP-1 primers, the Kozak and AP-2 primers, the 
Kozak and AP-3 primers, the Kozk and AP-4 primers and the Kozak and AP-5 
primers. This gel will be more fully described later. 

Fig. 5 is a partial sequence of the 5' end of a clone, Ki, that was cloned 
from the Al-5 cell line that was cultured at the non-permissive temperature and 
then shifted to the permissive temperature (32.5°C) for 24 h prior to the 
preparation of the mRNA. The Al-5 cell line is from a primary rat embryo 
fibroblast cell line that has been doubly transformed with ras and a temperature 
sensitive mutation of P 53 ("P 53 ""). 
General Description. Development of the Method 

By way of illustration a description of examples of the method of the 
invention follows, with a description by way of guidance of how the particular 
illustrative examples were developed. 

It is important for operation of the method that the length of the 
oligodeoxynucleotide be appropriate for specific hybridization to mRNA. In 
order to obtain specific hybridization, whether for conventional cloning methods 
or PCR, oligodeoxynucleotides are usually chosen to be 20 or more nucleotides 
in length. The use of long oligodeoxynucleotides in this instance would decrease 
the number of mKNAs identified during each trial and would greatly increase the 
number of oligodeoxynucleotides required to identify every mRNA. Recently, it 
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was demonstrated that 9-10 nucleotide primers can be used for DNA 
polymorphism analysis by PCR (Williams et ah, 1991, Nuc. Acids Res., Vol. 
18, pp. 6531-6535). 



cDNA plasmid") was used as a model template to determine the required lengths 
of oligodeoxynucleotides for specific hybridization to a mRNA, and for the 
production of specific PCR products. The oligodeoxynucleotide primer chosen to 
hybridize internally in the mRNA was varied between 6 and 13 nucleotides in 
length, and the oligodeoxynucleotide primer chosen to hybridize at the upstream 
end of the polyA tail was varied between 7 and 14 nucleotides in length. After 
numerous trials with different sets and lengths of primers, it was determined that 
the annealing temperature of 42°C is optimal for product specificity and the 
internally hybridizing oligodeoxynucleotide should be at least 9 nucleotides in 
length and a oligodeoxynucleotide that is at least 13 nucleotides in length is 
required to bind to the upstream end of the polyA tail. 

With reference now to Fig. 1, the method according to the invention is 
depicted schematically. The mRNAs are mixed with the first primer, for 
example TTTTTTTTTTTVN [Seq. ID. No. 2] (T n VN) 1, and reverse 
transcribed 2 to make the first cDNA strand. The cDNA is amplified as follows. 
The first cDNA strand is added to the second primer and the first primer and the 
polymerase in the standard buffer with the appropriate concentrations of 
nucleotides and the components are heated to 94°C to denature the mRNA:cDNA 
hybrid 3, the temperature is reduced to 42°C to allow the second primer to 
anneal 4, and then the temperature is increased to 72 °C to allow the polymerase 
to extend the second primer 5. The cycling of the temperature is then repeated 
6, 7, 8, to begin the amplification of the sequences which are hybridized by the 
first and second primers. The temperature is cycled until the desired number of 
copies of each sequence have been made. 

As is well known in the art, this amplification method can be accomplished 
using thermal stable polymerase or a polymerase that is not thermal stable. 
When a polymerase that is not thermal stable is used, fresh polymerase must be 
added after the annealing of the primers to the templates at the start of the 
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elongation or extending step, and the extension step must be earned out at a 
temperature that is permissible for the chosen polymerase. 

The following examples of the method of the invention are presented for 
illustrative purposes only. As will be appreciated, the method according to the 
5 invention can be used for the isolation of polyA mRNA from any source and can 
be used to isolate genes expressed either differentially or constitutively at any 
level, from rare to abundant. 
Example*. 1 

Experimentation with the conditions required for accurate and reproducible 
10 results by PCR were conducted with the TK cDNA plasmid and a single set of 
oligodeoxynucleotide primers; the sequence TTTTTTTTTTTCA ("TuCA") [Seq. 
ID. No. 10] was chosen to hybridize to the upstream end of the polyA tail and 
the sequence CTTGATTGCC ("Ltk3") [Seq. ID. No. 11] was chosen to 
hybridize 288 base pairs ("bp") upstream of the polyA tail. The expected 
15 fragment size using these two primers is 299 bp. 

PCR was conducted under standard buffer conditions well known in the art 
with 10 ng TK cDNA plasmid (buffer and polymerase are available from Perkin 
Elmer-Cetus). The standard conditions were altered in that the primers were 
used at concentrations of 2.5 pM T U CA [Seq. ID. No. 10] , 0.5 pM Ltk3 [Seq. 
ID. No. 11] , instead of 1 i*M of each primer. The concentration of the 
nucleotides ("dNTPs") was also varied over a 100 fold range, from the standard 
200 pM to 2 fiM. The PCR parameters were 40 cycles of a denaturing step for 
30 seconds at 94°C, an annealing step for 1 minute at 42°C, and an extension 
step for 30 seconds at 72°C. Significant amounts of non-specific PCR products 
25 were observed when the dNTP concentration was 200 nM, concentrations of 
dNTPs at or below 20 fiM yielded specifically amplified PCR products. The 
specificity of the PCR products was verified by restriction endonuclease digest of 
the amplified DNA, which yielded the expected sizes of restriction fragments. In 
some instances it was found that the use of up to 5 fold more of the first primer 
30 than the second primer also functioned to increase the specificity of the product. 
Lowering the dNTP concentration to 2 fiM allowed the labelling of the PCR 
products to a high specific activity with [oc- 35 S] dATP, 0.5 /*M [a- 3S S] dATP (Sp. 
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Act. 1200 Ci/mmol), which is necessary for distinguishing the PCR products 
when resolved by high resolution denaturing polyacrylamide gel electrophoresis, 
in this case a DNA sequencing gel. 
Example 2 

The PCR method of amplification with short oligodeoxynucleotide primers 
was then used to detect a subset of mRNAs in mammalian cells. Total RNAs 
and mRNAs were prepared from mouse fibroblasts cells which were either 
growing normally, "cycling", or serum starved, "quiescent". The RNAs and 
mRNAs were reverse transcribed with T„CA [Seq. ID. No. 10] as the primer. 
The T n CA primer [Seq. ID. No. 10] was annealed to the mRNA by heating the 
mRNA and primer together to 65 °C and allowing the mixture to gradually cool 
to 35 °C. The reverse transcription reaction was carried out with Moloney 
murine leukemia virus reverse transcriptase at 35°C. The resultant cDNAs were 
amplified by PCR in the presence of T n CA [Seq. ID. No. 10] and Ltk3 [Seq. 
ID. No. 11] , as described in Example 1, using 2 dNTPs. The use of the 
T„CA [Seq. ID. No. 10] and Ltk3 [Seq. ID. No. 11] primers allowed the TK 
mRNA to be used as an internal control for differential expression of a rare 
mRNA transcript; TK mRNA is present at approximately 30 copies per cell. 
The DNA sequencing gel revealed 50 to 100 amplified mRNAs in the size range 
which is optimal for further analysis, between 100 to 500 nucleotides. The 
patterns of the mRNA species observed in cycling and quiescent cells were very 
similar as expected, though some differences were apparent. Notably, the TK 
gene mRNA, which is expressed during Gl and S phase, was found only in the 
RNA preparations from cycling cells, as expected, thus demonstrating the ability 
of this method to separate and isolate rare mRNA species such as TK. 
Example 3 

The expression of mRNAs in normal and tumorigenic mouse fibroblast 
cells was also compared using the T n CA [Seq. ID. No. 10] and Ltk3 [Seq. ID. 
No. 11] primers for the PCR amplification. The mRNA was reverse transcribed 
using T U CA [Seq. ID. No. 10] as the primer and the resultant cDNA was 
amplified by PCR using 2 fxM dNTPs and the PCR parameters described above. 
The PCR products were separated on a DNA sequencing gel. The TK mRNA 
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was present at the same level in both the normal and tumorigenic mRNA 
preparations, as expected, and provided a good internal control to demonstrate 
the representation of rare mRNA species. Several other bands were present in 
one preparation and not in the other, with a few bands present in only the mRNA 
from normal cells and a few bands present only in the mRNA from the 
tumorigenic cells; and some bands were expressed to different levels in the 
normal and tumorigenic cells. Thus, the method according to the invention can 
be used to identify genes which are normally continuously expressed 
(constitutive), and differentially expressed, suppressed, or otherwise altered in 
their level of expression. 

Clnninp of the mRNA identified in Example. 3 

Three cDNAs that are, the TK cDNA, one cDNA expressed only in 
normal cells ("Nl"), and one cDNA expressed only in tumorigenic cells ("Tl"), 
were recovered from the DNA sequencing gel by electroelution, ethanol 
precipitated to remove the urea and other contaminants, and reamplified by PGR, 
in two consecutive PCR amplifications of 40 cycles each, with the primers T U CA 
[Seq. ID. No. 10] and Ltk3 [Seq. ID. No. 11] in the presence of 20 fiM dNTPs 
to achieve optimal yield without compromising the specificity. The reamplified 
PCR products were confirmed to have the appropriate sizes and primer 
dependencies as an additional control the reamplified TK cDNA was digested 
with two separate restriction endonucleases and the digestion products were also 
confirmed to be of the correct size. 

The reamplified Nl [Seq. ID. No. 9] was cloned with the TA cloning 
system, Invitrogen Inc., into the plasmid pCRlOOO and sequenced. With 
reference now to Fig. 2, the nucleotide sequence clearly shows the Nl fragment 
[Seq. ID. No. 9] to be flanked by the underlined Ltk3 primer 15 at the 5' end 
and the underlined T n CA primer 16 at the 3' end as expected. 

A Northern analysis of total cellular RNA using a radiolabelled Nl probe 
reconfirmed that the Nl mRNA was only present in the normal mouse fibroblast 
cells, and not in the tumorigenic mouse fibroblast cells. With reference now to 
Kg. 3, the probe used to detect the mRNA is labelled to the right of the figure, 
and the size of the Nl mRNA can be estimated from the 28S and 18S markers 
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depicted to the left of the figure. The Nl mRNA is present at low abundance in 
both exponentially growing and quiescent normal cells, lanes 1 and 3, and is 
absent from both exponentially growing or quiescent tumorigenic cells, lanes 2 
and 4. As a control, the same Northern blot was reprobed with a radiolabelled 
5 probe for 36B4, a gene that is expressed in both normal and tumorigenic cells, to 
demonstrate that equal amounts of mRNA, lanes 1-4, were present on the 
Northern blot. 
Example 4 

The comparison of the expression of mRNAs in three cell lines, one of 

10 which was tested after culturing under two different conditions, was conducted. 
The cell lines were a primary rat embryo fibroblast cell line ("REF"), the REF 
cell line that has been doubly transformed with ras and a mutant of P 53 ("T101- 
4"), and the REF cell line that has been doubly transformed with ras and a 
temperature sensitive mutation of P 53 ("Al-S"). The Al-5 cell line was cultured 

15 at the non-permissive temperature of 37°C, and also cultured at 37° C then 

shifted to the permissive temperature of 32.5°C for 24 h prior to the preparation 
of the mRNA. The method of the invention was conducted using the primers 
"Kozak" and one of five arbitrary sequence primers, "AP-1, AP-2, AP-3. AP-4, 
or AP-5", as the second and first primers, respectively. 

20 The sequence of the "Kozak" primer was chosen based upon the published 

consensus sequence for the translation start site consensus sequence of mRNAs 
(Kozak, 1991, Jour. Cell Biology, Vol. 115, pp. 887-903). A degenerate Kozak 
primer having sequences substantially identical to the translation start site 
consensus sequence were used simultaneously, these sequences were 5'- 

25 GCCRCCATGG [Seq. ID No. 12], in which the R is dA or dG and thus the 
oligodeoxynucleotide primer has only one of the given nucleotides which results 
in a mixture of primers. 

The sequence of the five arbitrary primers was a follows: AP-1 had the 
sequence 5 1 - AGCC AGCGAA [Seq. ID. No. 13]; AP-2 had the sequence 

30 5 • -GACCGCTTGT [Seq. ID. No. 14]; AP-3 had the sequence 5'- 

AGGTGACCGT [Seq. ID. No. 15]; AP-4 had the sequence 5'-GGTACTCCAC 
[Seq. ID. No. 16]; and AP-5 had the sequence 5 1 -GTTGCGATCC [Seq. ID. No. 
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17]. These arbitrary sequence primers were chosen arbitrarily. Li general each 
arbitrary sequence primer was chosen to have a GC content of 50-70%. 

The mRNA was reverse transcribed using one of the AP primers, as the 
first primer, and the resultant first cDNA strand was amplified in the presence of 
both primers, the AP primer and the degenerate Kozak primer, by PCR using 2 
AtM NTPs and the PCR parameters described above. The PCR products were 
separated on a DNA sequencing gel. At least 50-100 amplified cDNA bands 
were present in each of the cell lines tested, and some bands were expressed to 
different levels in the different cell lines. As a control a reaction was conducted 
using each arbitrary primer in the absence of the Kozak primer. No cDNA was 
generated by the arbitrary primer alone, thus demonstrating that both primers 
were required to amplify an mRNA into a cDNA. 

With reference now to Fig. 4, the primer sets used for each reaction are 
shown at the top of the Fig. along the line marked Primers . As a control a 
reaction was conducted using the primers in the absence of mRNA, and using 
AP-1 with mRNA in the absence of the Kozak primer. No cDNA was generated 
by the primers in the absence of mRNA or by the arbitrary primer alone, thus 
demonstrating that mRNA is required for amplification and that both primers 
were required to amplify an mRNA into a cDNA. The cDNA products of the 
amplification were loaded in the same order across the gel, thus the REF cell line 
is shown in each of lanes 1, cell line T101-4 is shown in each of lanes 2, cell 
line Al-5 cultured at 37°C is shown in each of lanes 3, and cell line Al-5 
cultured at 32.5<>C is shown in each of lanes 4. Each pair of primers resulted in 
the amplification of a different set of mRNAs from the cell lines. The reactions 
which were conducted using the Kozak primer and any of primers AP-1, AP-2, 
AP-4, or AP-5 as a primer set resulted in the amplification of the same cDNA 
pattern from each of cell lines REF, T101-4, Al-5 cultured at 37©C and Al-5 
cultured at 32.5°C. The amplification of mRNA from each cell line and 
temperature using the Kozak degenerate primer and the AP-3 primer resulted in 
the finding of one band in particular which was present in the mRNA prepared 
from the Al-5 cell line when cultured at 32.5°C for 24 h, and not in any of the 
other mRNA preparations, as can be seen in Fig. 4 designated as Ki. Thus the 
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method according to the invention may be used to identify genes which are 
differentially expressed in mutant cell lines. 
Cloning of the mRNA identified in Bxampla A 

The cDNA ("Ki") that was expressed only in the Al-5 cell line when 
5 cultured at 32.5°C was recovered from the DNA sequencing gel and reamplified 
using the primers Kozak and AP-3 as described above. The reamplified Ki 
cDNA was confirmed to have the appropriate size of approximately 450 bp, and 
was cloned with the TA cloning system, Invitrogen Inc., into the vector pCRU 
(Invitrogen, Inc.) according to the manufacturers instructions, and sequenced. 
10 With reference now to Fig. 5, the nucleotide sequence clearly shows the Ki 
clone to be flanked by the underlined Kozak primer 20 at the 5' end and the 
underlined AP-3 primer 21 at the 3' end as expected. The 5 ' end of this partial 
cDNA is identified in Seq. ID No. 18, and the 3' end of this cDNA is identified 
in Seq. ID No. 19. This partial sequence is an open reading frame, and a search 
15 of the gene databases EMBO and Genbank has revealed the translated amino acid 
sequence from the 3* portion of Ki to be homologous to the ubiquitin conjugating 
enzyme family (UBC enzyme). The translated amino acid sequence of the 3' 
portion of Ki is 100% identical to a UBC enzyme from D. mekmogaster and 
75 % identical to the UBC-4 enzyme and 79 % identical to the UBC-5 enzyme 
from the yeast S. saccharomyces; and 75% identical to the UBC enzyme from 
Arabidopsis thaUana. The Ki clone may contain the actual 5' end of this gene, 
otherwise the Kozak primer hybridized just after the 5' end. This result 
demonstrates that the method according to the invention can be used to clone the 
5' coding sequence of a gene 
25 Use 

The method according to the invention can be used to identify, isolate and 
clone mRNAs from any number of sources. The method provides for the 
identification of desirable mRNAs by simple visual inspection after separation, 
and can be used for investigative research, industrial and medical applications. 
30 For instance, the reamplified cDNAs can be sequenced, or used to screen a 

DNA library in order to obtain the full length gene. Once the sequence of the 
cDNA is known, amino acid peptides can be made from the translated protein 
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sequence and used to raise antibodies. These antibodies can be used for further 
research of the gene product and its function, or can be applied to medical 
diagnosis and prognosis. The reamplified cDNAs can be cloned into an 
appropriate vector for further propagation, or cloned into an appropriate 
expression vector in order to be expressed, either in vitro or in vivo. The 
cDNAs which have been cloned into expression vectors can be used in industrial 
situations for overproduction of the protein product. In other applications the 
reamplified cDNAs or their respective clones will be used as probes for in situ 
hybridization. Such probes can also be used for the diagnosis or prognosis of 
disease. 

Other Embodiments 

Other embodiments are within the following claims. 

The length of the oligodeoxynucleotide can be varied dependent upon the 
annealing temperature chosen. In the preferred embodiments the temperature 
was chosen to be 42°C and the oligonucleotide primers were chosen to be at 
least 9 nucleotides in length. If the annealing temperature were decreased to 
35 °C then the oligonucleotide lengths can be decreased to at least 6 nucleotides 
in length. 

The cDNA could be radiolabelled with radioactive nucleotides other than 
35 S, such as 32 P and ^P. When desired, non-radioactive imaging methods can 
also be applied to the method according to the invention. 

The amplification of the cDNA could be accomplished by a temperature 
cycling polymerase chain reaction, as was described, using a heat stable DNA 
polymerase for the repetitive copying of the cDNA while cycling the temperature 
for continuous rounds of denaturation, annealing and extension. Or the 
amplification could be accomplished by an isothermal DNA amplification method 
(Walker etaL, 1992, Proc. Natl Acad. Sci., Vol. 89, pp. 392-396). The 
isothermal amplification method would be adapted to use for amplifying cDNA 
by including an appropriate restriction endonuclease sequence* one that will be 
nicked at hemiphosphorothioate recognition sites and whose recognition site can 
be regenerated during synthesis with a 35 S labelled dNTPs. 
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Proteins having similar function or similar functional domains are often 
referred to as being part of a gene family. Many such proteins have been cloned 
and identified to contain consensus sequences which are highly conserved 
amongst the members of the family. This conservation of sequence can be used 
5 to design oUgodeoxynucleotide primers for the cloning of new members, or 
related members, of a family. Using the method of the invention the mRNA 
from a cell can be reverse transcribed, and a cDNA could be amplified using at 
least one primer that has a sequence substantially identical to the sequence of a 
mRNA of known sequence. Consensus sequences for at least the following 

10 families and functional domains have been described in the literature: protein 

tyrosine kinases (Hanks et al, 1991, Methods on Enzymology, Vol. 200, pp. 38- 
81; Wilks, 1991, Methods in Enzymology, Vol. 200, pp. 533-546); homeobox 
genes; zinc-finger DNA binding proteins (Miller et al. , 1985, EMBO Jour. , Vol. 
4, pp. 1609-1614); receptor proteins; the signal peptide sequence of secreted 

15 proteins; proteins that localize to the nucleus (Guiochon-Mantel et al. , 1989, 
Vol. 57, pp. 1147-1154); serine proteases; inhibitors of serine proteases; 
cytokines; the SH2 and SH3 domains that have been described in tyrosine kinases 
and other proteins (Pawson et al, 1992, Cell, Vol 71, pp. 359-362); 
serine/threonine and tyrosine phosphatases (Cohen, 1991, Methods in 

20 Enzymology, Vol. 201, pp. 398-408); cyclins and cyclin-dependent protein 
kinases (CDKs) {see for ex., Keyomarsi etal, 1993, Proc. Natl. Acad. Sci., 
USA, Vol. 90, pp. 1112-1116). 

Primers for any consensus sequence can readily be designed based upon the 
codon usage of the amino acids. The incoiporation of degeneracy at one or more 

25 sites allows the designing of a primer which will hybridize to a high percentage, 
greater than 50%, of the mRNAs containing the desired consensus sequence. 

Primers for use in the method according to the invention could be designed 
based upon the consensus sequence of the zinc finger DNA binding proteins, for 
example, based upon the amino acid consensus sequence of the proteins PYVC. 

30 Useful primers for the cloning of further members of this family can have the 
following sequences: 5 1 -GTAYGCNTGT [Seq. ID. No. 20] or 5'- 
GTAYGCNTGC [Seq. ID. No. 21], in which the Y refers to the 
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deoxynucleotides dT or dC for which the primer is degenerate at this position, 
and the N refers to inosine ("I"). The base inosine can pair with all of the 
other bases, and was chosen for this position of the oligodeoxynucleotide as the 
codon for valine "V" is highly degenerate in this position. The described 
5 oligodeoxynucleotide primers as used will be a mixture of 5 1 -GTATGCTTGT 
and 5 1 -GTACGCTTGT or a mixture of 5 ' -GTATGCITGC and 
S'-GTACGCITGC. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Liang, Peng 

Pardee, Arthur B . 
(ii) TITLE OF INVENTION: Identifying, Isolating and Cloning 
Messenger RNAs 
(iii) NUMBER OF SEQUENCES: 21 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Choate, Hall & Stewart 

(B) STREET: Exchange Place, 53 State* Street 

(C) CITY": Boston 

(D) STATE: Massachusetts 

(E) COUNTRY: U.S.A. 

(F) ZIP: 02190 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER : IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 
(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 11 -MAR- 1993 

(C) CLASSIFICATION: 
(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/850,343 

(B) FILING DATE: 11 -MAR- 1992 
(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Pasternack, Sam 

(B) REGISTRATION NUMBER: 29,576 

(C) REFERENCE /DOCKET NUMBER: DFCI234CIP 
(ix) TELE COMMONI CATION INFORMATION: 

(A) TELEPHONE: 617 227-5020 

(B) TELEFAX: 617 227-7566 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
TTTTTTTTTT TVN 
2) INFORMATION FOR SEQ ID NO: 2: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 r 
10 TTTTTTTTTT TTV 

(2) INFORMATION POR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MO LECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

TTTTTTTTTT VNN 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
30 (iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
NNTTTATTNN 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
40 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 
GCTTTATTNC 

(2) INFORMATION FOR SEQ ID NO: 6: 
45 (i) SEQUENCE CHARACTERISTICS: 



lOl /o 



-25- 



PCT/US93/02246 



(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
NNNANNATGN 1Q 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
NNNANNATGG 10 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPdTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GCCACCATGG 1Q 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 260 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 
CTTGATTGCC TCCTACAGCA GTTGCAGGCA CCTTTAGCTG TACCATGAAG TTCACAGTCC 60 
GGGATTGTGA CCCTAATACT GGAGTTCCAG ATGAAGATGG ATATGATGAT GAATATGTGC 120 
TGGAAGATCT TGAGGTAACT GTGTCTGATC ATATTCAGAA GATACTAAAA CCTAACTTCG 180 
CTGCTGCCTG GGAAGAGGTG GGAGGAGCAG CTGCGACAGA GCGTCCTCTT CACAGAGGGG 24 0 



WO 93/18176 



-26- 



TCCTGGGTGA AAAAAAAAAA 
(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 
5 (b) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 
10 (iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
TTTTTTTTTT TCA 

(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
20 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GTTGATTGCC 

(2) INFORMATION FOR SEQ ID NO: 12: 
25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
GCCRCCATGG 
35 (2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
45 AGCCAGCGAA 
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(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE : nucleic acid 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

GACCGCTTGT 

(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
15 (b) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 
20 (iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
AGGTGACCGT 

(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
30 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGTACTCCAC 

(2) INFORMATION FOR SEQ ID NO: 17: 
35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GTTGCGATCC 
45 (2) INFORMATION FOR SEQ ID NO: 18: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOIiOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL : NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
10 GCCGCCATGG CTCTGAAGAG AATCCACAAG GACACCCATG AA 42 
(2) INFORMATION FOR SEQ ID NO: 19 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GTTGCATTTA CAACAAGAAT TTATCATCCA AATATTAACA GTAATGGCAG CATTTGTCTT SO 
GATATTCTAC GGTCACCT 78 
(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) 'TOPOLOGY: linear 

(ii) M O LE CULE TYPE: other nucleic acid 
30 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GTAYGCNTGT 10 
(2) INFORMATION FOR SEQ ID NO: 21: 
35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21r 
GTAYGCNTGC 10 

45 



wuv3/i»i/o PCT/US93/02246 

-29- 
Claims 

1. A non-specific cloning method for isolating in a nucleic acid sample a 
DNA complementary to a mRNA, comprising 

contacting the mRNA with a first oligodeoxynucleotide under conditions in 
which said first oligodeoxynucleotide hybridizes with mRNA at a site including a 
sequence immediately upstream of a first A ribonucleotide of the mRNA's polyA 
tail, 

reverse transcribing the mRNA using a reverse transcriptase, using said 
first oligodeoxynucleotide as a primer, to produce a first DNA strand 
complementary to at least a portion of the mRNA upstream from the site of 
hybridization of said first oligodeoxynucleotide with the mRNA, 

contacting the first DNA strand with a second oligodeoxynucleotide under 
conditions in which said second oligodeoxynucleotide hybridizes with DNA, 

extending the second oligodeoxynucleotide using a DNA polymerase to 
produce a second DNA strand complementary to the first DNA strand 
downstream from the site of hybridization of said second oligodeoxynucleotide 
with said first DNA strand, and 

amplifying the first and second DNA strands using a DNA polymerase, 
using said first and second oligodeoxynucleotides as primers. 

2. The method of claim 1 wherein said first oligodeoxynucleotide 
hybridizes with the mRNA at a site that includes at least one base upstream from 
and adjacent to the first A ribonucleotide of the polyA tail. 

3. The method of claim 2 wherein said first oligodeoxynucleotide 
hybridizes with the mRNA at a site that includes at least two bases upstream 
from and adjacent to the first A ribonucleotide of the polyA tail. 

4. The method of claim 1 wherein said first oligodeoxynucleotide 
includes a polyA-complementary region comprising at least 11 bases and, 
upstream from said polyA-complementary region, a non-poly-A complementary 
region comprising at least one base. 

5. The method of claim 4 wherein said non-poly A-complementary 
region comprises at least 2 contiguous bases 
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6. The method of claim 5 wherein said non-polyA-complementary 
region comprises 3'-NV, wherein V is one of dA, dC or dG, and N is one of 
dA, dT, dC or dG. 

7. The method of claim 4 wherein said first oligodeoxynucleotide 
comprises at least 13 bases. 

8 . The method of claim 1 wherein said second oligodeoxynucleotide 
comprises at least 6 deoxyribonucleo tides . 

9. The method of claim 1 wherein said second oligodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 

10. The method of claim 1 wherein said second oligodeoxynucleotide 
includes a randomly selected nucleotide sequence. 

11- The method of claim 1 wherein said first or second 
oligodeoxynucleotide includes a selected arbitrary sequence. 

12. The method of claim 1 wherein said first or the second 
oligodeoxynucleotide includes dC, dG, dT and dA. 

13. The method of claim 1 wherein said first or second 
oligodeoxynucleotide includes a restriction endonuclease recognition sequence. 

14. The method of claim 1 wherein said second oligodeoxynucleotide 
includes a sequence identical to a sequence contained within a mRNA of known 
sequence. 

15. The method of claim 1 wherein at least one of said first or second 
oligodeoxynucleotides comprises a plurality of oligodeoxynucleotides. 

16. A non-specific cloning method for isolating in a nucleic acid sample a 
DNA complementary to a mRNA, comprising 

contacting the mRNA with a first oUgodeoxynucleotide under conditions in 
which said first oligodeoxynucleotide hybridizes with mRNA at a site that 
includes the mRNA's polyA signal sequence, 

reverse transcribing the mRNA using a reverse transcriptase, using said 
first oligodeoxynucleotide as a primer, to produce a first .DNA strand 
complementary to at least a portion of the mRNA upstream from the site of 
hybridization of said first oligodeoxynucleotide with the mRNA, 
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contacting the first DNA strand with a second oligodeoxynucleotide under 
conditions in which said second oligodeoxynucleotide hybridizes with DNA, 

extending the second oligodeoxynucleotide using a DNA polymerase to 
produce a second DNA strand complementary to the first DNA strand 
downstream from the site of hybridization of said second oligodeoxynucleotide 
with said first DNA strand, and 

amplifying the first and second DNA strands using a DNA polymerase, 
using said first and second oligodeoxynucleotides as primers. 

17. The method of claim 16 wherein said first oligodeoxynucleotide 
comprises at least 6 nucleotides. 

18. The method of claim 16 wherein said first oligodeoxynucleotide 
comprises at least 9 nucleotides. 

19. The method of claim 16 wherein said second oligodeoxynucleotide 
comprises at least 6 deoxyribonucleotides. 

20. The method of claim 16 wherein said second oligodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 

21. The method of claim 16 wherein said second oligodeoxynucleotide 
includes a randomly selected nucleotide sequence. 

22. The method of claim 16 wherein said first or second 
oligodeoxynucleotide includes a selected arbitrary sequence. 

23. The method of claim 16 wherein said first or the second 
oligodeoxynucleotide includes dC, dG, dT and dA. 

24. The method of claim 16 wherein said first or second 
oligodeoxynucleotide includes a restriction endonuclease recognition sequence. 

25. The method of claim 16 wherein said second oligodeoxynucleotide 
includes a sequence identical to a sequence contained within a mRNA of known 
sequence. 

26. The method of claim 16 wherein at least one of said first or second 
oligodeoxynucleotides comprises a plurality of oligodeoxynucleotides. 

27. A method for isolating in a nucleic acid sample a DNA 
complementary to a mRNA, comprising 
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contacting the mRNA with a first oligodeoxynucleotide under conditions in 
which said first oligodeoxynucleotide hybridizes with mRNA at a site, 

reverse transcribing the mRNA using a reverse transcriptase, using said 
first oligodeoxynucleotide as a primer, to produce a first DNA strand 
complementary to at least a portion of the mRNA upstream from said site of 
hybridization of said first oligodeoxynucleotide with the mRNA, 

contacting the first DNA strand with a second oligodeoxynucleotide under 
conditions in which said second oligodeoxynucleotide hybridizes with the DNA 
strand at a site, said site including a Kozak sequence, 

extending the second oligodeoxynucleotide using a DNA polymerase to 
produce a second DNA strand complementary to the first DNA strand 
downstream from the site of hybridization of said second oligodeoxynucleotide 
with said first DNA strand, and 

amplifying the first and second DNA strands using a polymerase, using 
said first and second oligodeoxynucleotides as primers. 

28. The method of claim 27 wherein said first oligodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 

29. The method of claim 27 wherein said first oligodeoxynucleotide 
comprises 10 deoxyribonucleotides. 

30. The method of claim 27 wherein said second oligodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 

31. The method of claim 27 wherein said second oligodeoxynucleotide 
comprises 10 deoxyribonucleotides. 

32. The method of claim 27 wherein said first oligodeoxynucleotide is 
composed of a randomly selected sequence of deoxyribonucleotides. 

33. The method of claim 27 wherein said first oligodeoxynucleotide 
includes a selected arbitrary sequence of deoxyribonucleotides. 

34. The method of claim 27 wherein said first oligodeoxynucleotide 
includes a restriction endonuclease recognition sequence. 

35. The method of claim 27 wherein said first oligodeoxynucleotide 
includes a sequence substantially identical to a sequence contained within an 
mRNA of known sequence. 
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36. The method of claim 27 wherein said second oligodeoxynucleotide 
further includes a restriction endonuclease sequence. 

37. The method of claim 27 wherein at least one of said first or second 
oligodeoxynucleotides comprises a plurality of oligodeoxy nucleotides. 

^ 38. A non-specific cloning method for isolating in a nucleic acid sample 

a DNA complementary to a mRNA, comprising 

contacting the mRNA with a first oligodeoxynucleotide, having a base 
sequence substantially complementary to a sequence in a mRNA of known 
sequence, under conditions in which said first oligodeoxynucleotide hybridizes 
with mRNA at a site having said substantially identical sequence, 

reverse transcribing the mRNA using a reverse transcriptase, using said 
first oligodeoxynucleotide as a primer, to produce a first DNA strand 
complementary to at least a portion of the mRNA upstream from said site of 
hybridization of said first oligodeoxynucleotide with the mRNA, 

contacting the first DNA strand with a second oligodeoxynucleotide under 
conditions in which said second oligodeoxynucleotide hybridizes with the DNA 
strand at a site, 

extending the second oligodeoxynucleotide using a DNA polymerase to 
produce a second DNA strand complementary to the first DNA strand 
downstream from said site of hybridization of said second oligodeoxynucleotide 
with said first DNA strand, and 

amplifying the first and second DNA strands using a polymerase, using 
said first and second oligodeoxynucleotides as primers. 

39. The method of claim 38 wherein said first oligodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 

40. The method of claim 38 wherein said first oligodeoxynucleotide 
comprises 10 deoxyribonucleotides. 

* 41. The method of claim 38 wherein said second oligodeoxynucleotide 

comprises at least 9 deoxyribonucleotides. 

42. The method of claim 38 wherein said second oligodeoxynucleotide 
comprises 10 deoxyribonucleotides. 
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43. The method of claim 38 wherein said first oligodeoxynucleotide 
further includes a restriction endonuclease sequence. 

44. The method of claim 38 wherein said second oligodeoxynucleotide 
is composed of a randomly selected sequence of deoxyribonucleotides. 

45. The method of claim 38 wherein said second oligodeoxynucleotide 
includes a selected arbitrary sequence of deoxyribonucleotides. 

46. The method of claim 38 wherein the base sequence of said second 
oligodeoxynucleotide contains a restriction endonuclease recognition sequence. 

47. The method of claim 38 wherein at least one of said first or second 
oligodeoxynucleotides comprises a plurality of oligodeoxynucleotides. 

48. A non-specific cloning method for isolating in a nucleic acid sample 
a DNA complementary to a mRNA, comprising 

contacting the mRNA with a first oligodeoxynucleotide under conditions in 
which said first oligodeoxynucleotide hybridizes with mRNA at a site, 

reverse transcribing the mRNA using a reverse transcriptase, using said 
first oligodeoxynucleotide as a primer, to produce a first DNA strand 
complementary to at least a portion of the mRNA upstream from said site of 
hybridization of said first oligodeoxynucleotide with the mRNA, 

contacting the first DNA strand with a second oUgodeoxynucleotide, having 
a sequence substantially identical to a sequence in a mRNA of known sequence, 
under conditions in which said second oligodeoxynucleotide hybridizes with the 
first DNA strand at a site containing a complement of said substantially identical 
sequence, 

extending the second oligodeoxynucleotide using a DNA polymerase to 
produce a second DNA strand complementary to the first DNA strand 
downstream from said site of hybridization of said second oligodeoxynucleotide 
with said first DNA strand, and 

amplifying the first and second DNA strands using a polymerase, using 
said first and second oligodeoxynucleotides as primers. 

49. The method of claim 48 wherein said first oligodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 
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50. The method of claim 48 wherein said first oligodeoxynucleotide 
comprises 10 deoxyribonucleotides. 

51. The method of claim 48 wherein said second oligodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 

52. The method of claim 48 wherein said second oligodeoxynucleotide 
comprises 10 deoxyribonucleotides. 

53. The method of claim 48 wherein said first oligodeoxynucleotide is 
composed of a randomly selected sequence of deoxyribonucleotides. 

54. The method of claim 48 wherein said first oligodeoxynucleotide 
includes a selected arbitrary sequence of deoxyribonucleotides. 

55. The method of claim 48 wherein said first oligodeoxynucleotide 
contains a restriction endonuclease recognition sequence. 

56. The method of claim 48 wherein said second oligodeoxynucleotide 
further includes a restriction endonuclease sequence. 

57. The method of claim 48 wherein at least one of said first or second 
oligodeoxynucleotides comprises a plurality of oligodeoxynucleotides. 

58. A non-specific cloning method for isolating in a nucleic acid sample 
a DNA complementary to a mRNA, comprising 

contacting the mRNA with a first oligodeoxynucleotide, having a sequence 
substantially complementary to a sequence in a mRNA of known sequence, under 
conditions in which said first oligodeoxynucleotide hybridizes with mRNA at a 
site containing said substantially identical sequence, 

reverse transcribing the mRNA using a reverse transcriptase, using said 
first oligodeoxynucleotide as a primer, to produce a first DNA strand 
complementary to at least a portion of the mRNA upstream from said site of 
hybridization of said first oligodeoxynucleotide with the mRNA, 

contacting the first DNA strand with a second oligodeoxynucleotide under 
conditions in which said second oligodeoxynucleotide hybridizes with the DNA 
strand at a site, said site including a Kozak sequence, 

extending the second oligodeoxynucleotide using a DNA polymerase to 
produce a second DNA strand complementary to the first DNA strand 
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downstream from said site of hybridization of said second oligodeoxynucleotide 
with said first DNA strand, and 

amplifying the first and second DNA strands using a polymerase, using 
said first and second oligodeoxynucleotides as primers. 

59. The method of claim 58 wherein said first oligodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 

60. The method of claim 58 wherein said first oligodeoxynucleotide 
comprises 10 deoxyribonucleotides. 

61. The method of claim 58 wherein said second oligodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 

62. The method of claim 58 wherein said second oligodeoxynucleotide 
comprises 10 deoxyribonucleotides. 



WU !U/1817b 



PCT/US93/02246 



1/A 



5' 



NBAAAAAAAAAAA An 



5' 



NVTTTTTTTTTTT-5 ' 



■NBAAAAAAAAAAA An 

-NVTTTTTTTTTTT-5' 



\ 



-NVTTTTTTTTTTT-5' 
5' -NNNNNNNNN 



4 



5'-NNNNNNNNN 



\ 



5'-NNNNNNNNN- 



+ NVTTT1 



-NVTTTTTTTTTTT-5' 



r-5' 



-NVTTTTTTTTTTT-5' 
-NBAAAAAAAAAAA 



5 '-NNNNNNNNN 



5'-NNNNNNNNN- 



-NVTTTTTTTTTTT-5 ' 
NVTTTTTTTTTTT-5 ' 



-NBAAAAAAAAAAA 



7 

VJ 



5'-NNNNNNNNN 



-NVTTTTTTTTTTT-5' 



5'-NNNNNNNNN- 



NVTI I 1 1 I ITTTT-5' 
-NBAAAAAAAAAAA 



8 



5'-NNNNNNNNN- 



NNNNNNNNN- 
5'-NNNNNNNNN- 



-N VTTTTTTTTTTT- 5 ' 
-NBAAAAAAAAAAA 



-NVTTTTTTTTTTT-5' 
-NBAAAAAAAAAAA 



FIG. I 



SUBSTITUTE SHEET 



WO 93/18176 A A PCT/US93/02246 

2/4 



FIG. 2 



10 20 30 40 50 60 

CTTGATTGCC TCCTACAGCA GTTGCAGGCA CCTTTAGCTG TACCATGAAG TTCACAGTCC 

10 70 80 90 100 110 120 

GGGATTGTGA CCCTAATACT GGAGTTCCAG ATGAAGATGG ATATGATGAT GAATATGTGC 



130 140 150 

TGGAAGATCT TGAGGTAACT GTGTCTGATC 

190 200 210 

CTGCTGCCTG GGAAGAGGTG GGAGGAGCAG 

250 260 
TCCTGG GTGA AAAAAAAAAA 

M6 



160 170 180 

ATATTCAGAA GATACTAAAA CCTAACTTCG 

220 230 240 

CTGCGACAGA GCGTCCTCTT CACAGAGGGG 



FIG. 3 



12 3 4 



28S- 
I8S- 




N1 




36 B4 



SUBSTITUTE SHEET 



WO 93/18176 



PCT/US93/02246 



3/4 



PRIMERS KOZAK 

AP-I AH(-K0ZAK) 

1AJL 1234 12 34 



FIG. 4 

KOZAK KOZAK KOZAK KOZAK KOZAK 

AP-I AP-2 AP-3 AP-4 AP-5 

1234 1234 1234 1234 1234 




SUBSTITUTE SHEET 



yrw«/*oi/o _ ^ PCT/US93/02246 

4/4 



5 ' -GCgACCATGGCTCTGAAGAGAATCCACAAGGACACCCATGAA 
Kozak 



GTTGCATTTACAACAAGAA 

TTTATCATCCAAATATTAACAGTAATGGCAGCATTTGTCTTGATATTCTACGGTCACCT- 3 ' 



3 ' TGCCAGTGGA - 5 ' 
AP-3 



FIG. 5 



SUBSTITUTE SHEET 



INTERNA 



X^pAL 



SEARCH REPORT 



iternational application No. 
PCT/US93/02246 



A. CLASSIFICATION OF SUBJECT MATTER 
IPC(5) :C12P 19/34 
US CL :435/91 

According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 
U.S. : 435/6,91;935/21 



Documentation searched other than minimum documentation to the extent that such documents arc included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 
APS, CA, BIOSIS, MEDLINE 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category' 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



M. Innis et al. "PCR PROTOCOLS, A GUIDE TO METHODS 
AND APPLICATIONS", published 1990 by ACADEMIC PRESS, 
INC. (CALIFORNIA), see pages 60-69. 

US, A, 4,683,195 (MULLIS ET AL) 28 JULY 1987, see column 6, 
lines 44-55, column 10, lines 47-57, claim 10. 

D. FREILFELDER, "MOLECULAR BIOLOGY, A 
COMPREHENSIVE INTRODUCTION TO PROKARYOTES AND 
EUKAR YOTES " , published 1983 by JONES AND BARTLETT 
PUBLISHERS, Inc. (BOSTON), see pages 402-404. 



1-62 

1-62 
16-26 



_j Further documents are listed in the continuation of Box C. Q See patent family annex. 



Special categories of cited documentor 

documentdelmmg the general state of the art which m not considered 
to be part of particular relevance 

earlier document published on or after Che mteraatxmal filing date 

document which may throw doubta on priority claimO) or which is 
cited to establish the publication date of another citation or other 
special reason (as specified) 

document referring Co an oral disclosure, use. exhibition or other 

documcntpubushed prior to the international filing dale but later than 
the priority date claimed 



Uterdocuaientrxxbliahedafu^the international filing date or priority 
date and not in conflict with the application but cited to understand the 
principle or theory underlying the inventioo 

document of particular relevance; the claimed invention cannot be 
considered novel or cannot be considered to involve an inventive step 
when the document is taken alone 

document of particular relevance; the claimed invention cannot be 
considered to involve an. inventive step when the document ■ 
combined with one or more other such documents, such combination 
being obvious to a person skilled in the art 

document member of the same patent family 



Date of the actual completion of the international search 
29 April 1993 



Date of mailing of the international search report// 

17 MAY 1 




Name and mailing address of the ISA/US 
Commiasioner of Patents and Trademarks 
Box PCT 

Washington, D.C. 20231 
Facsimile No. NOT APPLICABLE 



Authorized officer 

PAUL B. TRAN, PH.D. 
Telephone No. (703) 308-0196 



Form PCT/ISA/210 (second sheet)(July 1992)* 



i 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 
□^BLURRED OR ILLEGIBLE TEXT OR DRAWING 

,0SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



